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Abstract. We present an expository introduction to orthogonal polynomials 
on the unit circle. 



1. Introduction 

Orthogonal polynomials are the Rodney Dangerfield |108| of analysis. Because 
of the impact of Stieltjes' great 1895 paper on F. Riesz, Nevanlinna, and Hilbert's 
school, the moment problem and the closely related subject of orthogonal polynomi- 
als on the real line (OPRL) were central in the revolution in analysis from 1900-1920 
and provided critical precursors to the Hahn-Banach theorem, the Riesz-Markov 
theorem, the spectral theorem, and the theory of selfadjoint extensions. But in 
recent years, too often the subject is dismissed as "classical" and not worthy of 
further study. 

With developments in random matrix theory and combinatorics (e.g., 0101130 
H2ll43llf?2*ll79| L it is clear that orthogonal polynomials still have a lot to contribute. 
From one point of view, what makes them relevant is that they are the simplest 
of inverse spectral problems — indeed, Gel'fand-Levitan explicitly note that 
their approach to inverse theory for Schrodinger operators is motivated by OPRL. 
Recently, OPUC ideas have provided a matrix realization of Lax pairs for the 
(defocusing) AKNS equation [M] , 

What is true for OPRL is even more true for orthogonal polynomials on the unit 
circle (OPUC). While the closely related area of positive harmonic functions on 
D = {z £ C | \z\ < 1} drew the attention of Caratheodory, Fejer, Herglotz, F. Riesz, 
Schur, and Toeplitz in the 1910's, the subject was only invented by Szego in about 
1920, especially in his deep 1920-1921 paper jHSl- So OPUC never had its era of 
centrality but has had a steady but small following over the years. Traditionally, 
the book references for the subject were Szego's book [§7|, which has only one 
full and several partial chapters on OPUC, Geronimus' book and review |32|. 
and a chapter in Freud |25| . which are very dated. With a major development 
published only in 2003 (the CMV matrix of Sectional below), it is hard not to be 
dated. Motivated by this dearth of review literature and by the opportunity to use 
Schrodinger operator techniques in a new setting, I published two volumes [851181?) 
on the subject. Many friends asked if there wasn't some way to learn about the 
subject in less than 1100 pages, and this expository note is the result. 
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Throughout, we use D for the unit disk in C, and SO for the unit circle. Our inner- 
products, (f,g), are linear in g and antilincar in /. Significant missing material 
involve some explicit examples — these are discussed in Section 1.6 of |88) : my 
favorite are the Rogers-Szego polynomials (Example 1.6.5). This article undergoes 
a kind of phase transition in the middle of Section [5] in that before there, most 
results have proofs or at least sketches given, and afterwards, there aren't many 
proofs. This is because the earlier material is more central and also because the 
later proofs are lengthier. 

To put OPUC in context, recall some basics of OPRL. Since the fascinating 
issues of indeterminate moment problems (see ^IHU) are irrelevant to OPUC, we 
will assume all measures have compact support: 

(1) If p, is a probability measure on C (i.e., positive with /x(C) = 1) with compact 
support and X n (z) are the monic orthogonal polynomials (i.e., X n (z) = z n + 
lower order, X n _L z , i = 0, . . . , n — 1), 

n 

zX n (z) = X n+1 (z)+Y,aj n) Xj(z) (LI) 

3=0 

(„) _ (Xj,zX n ) 
a - ||x . ||2 (1-2) 

What makes OPRL special is that multiplication by x is selfadjoint, so if we 
use P n in place of X n for OPRL and p for p, 

(P jt xP n ) = (xPj,P n ) =0 j = 0, . . . , n - 2 

and thus l|1.2fl becomes 

xP n (x) = P n+ i(x) + b n+1 P n (x) + a^P n -x(x) (1.3) 

for Jacobi parameters, a n , 6„; n = 1, 2, If p n = P n /||P n || are the orthonor- 

mal OPRL, the matrix elements of multiplication by x in p n basis have the 
form: 

(b x oi 
ai b 2 a 2 
a 2 b 3 a 3 .. 

v / 

(2) There is a one-one correspondence between bounded J's (i.e., sup n |a n | + \b n \ < 
oo) and p on M with compact but infinite support. This is sometimes called 
Favard's theorem. 

(3) If A is a bounded selfadjoint operator on a separable Hilbert space, TC, and 
<p is a cyclic unit vector (i.e., {A n ip}^ =0 span TC), one can use the spectral 
theorem to find a measure dp on [— \\A\\, \\A\\] with J x n dp — (ip,A n (p) and 
then the OPRL for this measure to find a semi-infinite Jacobi matrix unitarily 
equivalent to A with ip mapped to (10 ...)*. This realization is unique, 
that is, the a„'s and 6„'s are intrinsic to the pair (A, (p). It was Stone who 
emphasized this point of view that the study of Jacobi matrices was the same 
as the study of selfadjoint operators with a distinguished cyclic vector. 

(4) A key role is played by the Stieltjes transform of p, that is, the function, to, 
on C\supp(dp) given by 

f dp(x) 

m(z) — / (1.5) 



J 



(1.4) 
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(5) The Jacobi parameters can also be captured from m(z) via a continued fraction 
expansion (of Stieltjes) at oo: 

1 

m(z) = ^ (1.6) 

af 

~ z + b i —r 2 — 

— Z + 2 - &2 . . . 

We will not discuss applications of OPUC in detail but note its important appli- 
cations to linear prediction and filtering theory. The basics are due to Wiener |1U7| , 
Kolmogorov [5TJ, Krein |S2103], and Levinson |57|. The ideas have been especially 
developed by Kailath [2511361137] . 

The title of this article is based on an incident reported in the Talmud HHJ that 
someone asked the famous first-century rabbi Hillel to describe Judaism to him 
while he stood on one foot. Hillel's answer was: "Do not do unto others that which 
is hateful to you. The rest is commentary. Go forth and study." This article is 
OPUC on one foot. jHEHHH] are commentary. 

It is a pleasure to thank M. Aizenman for pushing me to write such an article. 
I'd like to thank S. Denisov, F. Gesztesy, L. Golinskii, D. Lubinsky, F. Marcellan, 
P. Nevai, and G. Stolz for useful input. This paper was started while I was a 
visitor at the Courant Institute and completed during my stay as a Lady Davis 
Visiting Professor at Hebrew University, Jerusalem. I'd like to thank P. Deift 
and C. Newman for the hospitality of Courant and H. Farkas and Y. Last for the 
hospitality of the Mathematics Institute at Hebrew University. 

2. The Szego Recursion 

OPUC is the study of probability measures on <9B, that is, positive measures, /i, 
with 

fi(dB) = 1 (2.1) 
The Caratheodory function (after ^SJ) of \x is defined on D by 

F ( z ) = I i^M9) (2.2) 

This analog of l|1.5fl is an analytic function on U which obeys 

F(0) = 1 z e O ReF(z) > (2.3) 
The Schur function (after |84j) is then defined by 

F(z) = i±4^ (2.4) 
1 - zf{z) 

and is an analytic function mapping D to D, that is, 

sup|/(z)|<l (2.5) 

(f(z) = e t6a is included and produced by n, a point mass at e t9a ). 

H2.2(l sets up a one-one correspondence between probability measures \i and an- 
alytic functions obeying (|2.3|) — this is essentially a form of the Herglotz represen- 
tation (see PP- 247]) and can be realized via 

dfi = w-lim Re F(re l9 ) ^- (2.6) 
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or by 

oo 

F(z) = l + 2^c„z™ (2.7) 

71=1 

where c n are the moments of [i given by 

- me dfj,(6) (2.8) 



(|2.4|) sets up a bijection between f's obeying (|2.5|l and F's obeying l|2.3|l . 

We call a measure trivial if it is supported on a finite set and nontrivial otherwise. 
We will mainly be interested in nontrivial measures, fi is trivial if and only if its 
Schur function is a finite Blaschke product 

n-i 

/(*) = e ie ° n f^- ( 2 - g ) 

XJ - 1 — 

j=i J 

with 2i, ... , z n -i G D. Here n is the number of points in the support of d\i. Later 
(see the remark after Theorem 17. 1|) we will interpret l|2.9|) in terms of OPUC. 

If jj, is a nontrivial probability measure on 90, we define the monic orthogonal 
polynomials $„(z;d/i) (or $ n (z) if d/i is understood) by: 

$„(z) = z"+ lower order J e- ije $ n (e ie ) dfi{6) = j = 0, 1, 2, . . . , n-1 (2.10) 

so in L 2 (dB,dn), ($ n ,$ m ) = if n ^ to. The orthonormal polynomials are 
defined by 

where || • || is the L 2 -norm. {ip n }^Lo is an orthonormal set in L 2 . It may not be a 
basis (e.g., d[i(6) = d9/2ir where ip n (z) = z n and zi , j = 1, . . . , are orthogonal to 
all <p n ). We will discuss this further below (see Theorem 12. 2JI . 

If d[i is trivial, say supp(d/i) = {zj}j =1 , we can still define <&„,</?„ for n = 
0, 1, . . . , k — 1. We can even define (but not as the unique monic polynomial 
of degree k with ||$k|| = 0, that is, 

A: 

$fc(z) = ]J(z - Zj) (/i trivial) (2.12) 
j=i 

Clearly, (|2.10(l and the fact that the polynomials of degree at most n have di- 
mension n + 1 implies 

deg(P) < n, Flz 3 , j = 0, . . . ,n - 1 =>■ P = c$„ (2.13) 

On L 2 (9D, d/i), define the anti-unitary map, *' n , by 



/*'"(e ie ) = e me f(e ie ) (2.14) 
One mainly considers *'" on the set of polynomials of degree n which is left invariant: 

n n 

P(z) =J2 c i* j => P *' n ( z ) =J2 d ^ j = zn WF) (2-15) 
j=o 3=0 
Henceforth, following a standard, but unfortunate, convention, we drop the " , n" 
and just use P* , hoping the n is implicit. Note that 1* = z n , depending on n\ 



OPUC ON ONE FOOT 



5 



Since * is anti-unitary, Ij2.13|) implies 

deg{P)<n, P±z j , j = l,...,n=>P = c$* (2.16) 

Since (/, zg) = (z~ 1 f, g), it is easy to see that $„ + i — z$„ _L z J for j = 1, 2, . . . , n. 
Since <!> is monic, this difference is of degree n, so Ij2.16|l implies 

$„+i(z) = z$ n (z) - a n $* n (z) (2.17) 

for complex numbers a n , called the Verblunsky coefficients (in the older literature, 
also called reflection, Schur, Szego, or Geronimus coefficients). (|2.17|) is called Szego 
recursion after its first occurrence in Szego's book |97| . In the engineering literature, 
it is called the Levinson algorithm after its rediscovery in linear prediction theory 
|57| . The choice of minus and a n rather than a n will be made clear by Geronimus' 
theorem (see Theorem |OJ. Since $ n is monic, (|2.15|) implies <&* (0) = 1, so (|2.17l) 
at z = implies 

a» = -$„+i(0) (2.18) 

Theorem 2.1. We have 

||$ ti+1 || 2 = (l-| a „| 2 )||$ n || 2 (2.19) 

II^IHII^-KI 2 ) 172 ( 2 - 2 °) 

For any nontrivial [i, we have aj (dfi) € D for all j . If \x is trivial with n points in 
its support, then ao(dfi), . . . , a n -2(dfi) € D and a„_i(d/i) € 9D. 

Proof. (|2.17() . unitarity of multiplication by z, and $* _L imply 

||$„|| 2 = ||z$„|| 2 = ||$ n+1 + a„$;|| 2 = ||$, i+ i|| 2 + K| 2 ||$„|| 2 

which implies l|2.19|l . Induction and <& = 1 implies (|2.2U|) . By (|2.19|) . |aj| < 1 in the 



nontrivial case and for j = 0, . . . , n— 2 in the trivial case. Since ||$n|| = ^ ||$ n _i|| 
in the trivial case, l|2.19|l implies = 1. □ 

Since it arises often, we define 

p, = (i-N 2 ) 1/a k-i 2 +p 2 = i (2.2i) 



One can use <|2.2L)[1 to relate completeness of {VnjJ^Lo *° the Verblunsky coeffi- 
cients: 

Theorem 2.2. For any nontrivial measure, the following are equivalent: 

(a) lim^oo ||$ n || =0 

( b ) E^LoKf = 00 

(c) {<p n }??=o are a basis for L 2 (dJ]>,dfj,) 

Remark. We will see later that there is an additional equivalence via Szego's 
theorem (see (|8 .0|) . The equivalence of a Szego condition to completeness is due 
to Kolmogorov |SJ and Krein |521 153j . 

Sketch. By O, (a) & (b). If 

P [kA = projection in L 2 (dB, rf/i) onto span{z m }^ =fe (2.22) 

we have that 

||<M = ||(l-P M )zl (2.23) 
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= ||(1-P[i,„])l|| (2.24) 

= ||(l-P [ o,„_ 1] )z- 1 || (2.25) 

where (|2.23|) follows from the definition of <!>„, (|2.24() by applying *> n to z n and 
Pro, n -i], and l|2.25[) by using the fact that multiplication by is unitary. It 
follows that 

IKl-^o.oc))^ 1 !! = lim ||$„|| (2.26) 

n — >oc 

so (a) <^> z^ 1 € span{y n }^L . If z^ 1 ^ span{iy9„}^L , clearly they are not complete. 
If z^ 1 6 spa,n{tpj}JL , an argument (see the proof of Theorem 1.5.7 in [HHj) taking 
powers of z _1 shows z~ l £ span{iy9„}^L for all £, so {^p n }^Lo are complete. □ 

Let B 00 ^ denote the set of complex sequences {a.j}jL where either N — oo and 
| ct j | < 1 for all j, or else N < oo and ao, . . . , ctN-i S D while ajy £ dU>. In the 
topology of componentwise convergence, D°°' c is compact (and is a compactification 
of IB 00 ). The map, S, from fj, i— > {aj(dfj,)}jL is a well-defined map from .M+.i(<9B), 
the probability measures on c®, to D°°' c . By (2321, the a ' s determine the $„'s. 
Since /$„(z)c?/i = <5„o, the $ n 's determine the moments inductively, and so dfi, 
since {z £ } 00 = _ 00 span a dense set of C(<9B). Thus S is one-one. Moreover, 

Theorem 2.3 (Verblunsky's Theorem |1U2| ) ■ S is onto. 

[HE] has four proofs of this theorem (Theorems 1.7.11, 3.1.3, 4.1.5, and 4.2.8); 
see Section |3 below. Given that iS is a bijection, it is easy to see that it is a 
homeomorphism if A4+,i(dH) is given the vague (i.e., C(9D)-weak *) topology. 

Applying * (actually, *<™ +1 ) to (|2~TTIl yields 

= (2.27) 

Using (|2~T5|) and (|2~TT|) . we get the recursion relations for tp n written in matrix 
form 

* i \ = A(z,a n )[ (2.28) 
where 

z —a 



A(z,a)=p-^_ za 1 j (2.29) 

Notice that det A = z, so by inverting A, we get inverse recursion relations. We 
note the one for $ n _i: 

$n-l(z) = (2.30) 

Note that, by l|2.18[) . [. . . ] vanishes at zero, so the right side of (|2.30() is a polynomial 
of degree n — 1. This implies: 

Theorem 2.4 (Geronimus Let [i,v be two probability measures on dH> so 

that for some Nq, $>N (z;dfi) = ^^ (z;dh i ). Then <&j(z;dfi) = <$>j(z\dv) for j 

0. 1 Y, , 1 . a>j(dfi) = aj(dv) for j = 0, 1, . . . , No — 1, and <fj(z; dfj,) — <fij(z; dv) 

for j = 0,1,... ,N . 

Remark. As noted in a footnote in Geronimus [2] and rediscovered by Wendroff 
|1()5| . the result for OPRL requires equality for Pn and Pn -i and, in particular, 
it often happens that Pn (x, dj) = PN (x,dp), but no other Pj's are equal. 
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Proof. By (12.18(1 . $at at determines ajv -i, and so pjv -i; and thus $./Vo-i by 
(|2.30|1 . By induction, all ay, j < Nq — 1, and $j, j < Vo, are equal and so, by 
(E2SU), ||$,||, and so <^-. □ 

As a final aspect of Szego recursion, we turn to the Christoffel-Darboux formula 
(proven by Szego \§7\ for OPUC; Christoffel 121 an d Darboux ^8] had a similar 
formula for OPRL), which is an analog of an iterated Wronskian formula for ODE's. 
With A given by l|2.29[) . one finds, by matrix multiplication, that 

AiCa^h 1 °^JA(z,a n )=(-f J) (2.31) 



¥>n+l(0 Vn +1 (z) - Vn+l (C) fn+1 (z) 



- (1 - ZC) <MC) M*) + MO VIM ~ <P*n(0 <Pn(*)] 

which, iterated to n — (where [. . . ] =0), yields 
Theorem 2.5 (Szego [57]; CD Formula for OPUC). 

n 

(1 - <) X! MO M Z ) = <£n+l(0 ^n+l( z ) - fn+liO fn+l{z) (2-32) 
3=0 

If z = C and lie in D, we have various positivity facts that imply (the first since 
(p (z) = 1): 

\<P* n (zo)\ > (1- ko| 2 ) 1/2 forzoGD (2.33) 

OO 

limK +1 (^ )| = oo ^ ^|^-(z )| 2 = oo for z G D (2.34) 

3=0 

3. Verblunsky's and Geronimus' Theorems 

In this section, we will prove Verblunsky's theorem (Theorem I2.3|l and also a 
celebrated theorem of Geronimus. Our approach follows Section 3.1 of [BE, which 
claims a new proof of Geronimus' theorem assuming Verblunsky's theorem. But 
in preparing this article, we realized the argument can be slightly modified to also 
prove Verblunsky's theorem. 

To state Geronimus' theorem, we need to describe the Schur algorithm |84|. 
Given a Schur function, /, define 

7o(/) = /(0) f(z)= ™ + _ Zf l { ?\ (3.1) 

If 70 G 9D (i.e., f(z) = 70), we do not define f\. Otherwise, /1 defined by 1(3. ljl is 
also a Schur function since w — > (70 + w)/(l+ 7ow) is a biholomorphic bijection of 
D to D if I70I < 1, and g a Schur function with g(0) = implies g(z)/z is a Schur 
function (the Schwarz lemma). 

1)3.1(1 is called the Schur algorithm. It can be iterated, that is, we define j n {f), 
the Schur parameters, and f n +i, the Schur iterates, inductively by 

ln{f) = fn(0) /„(*) = 7 "t Z/ ? +l(z) (3.2) 

1 +lnZJ n +l{z) 
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If, for some n, f n (z) — e l0a , we set j n — e i0 ° and stop. In this way, we map any 
Schur function, /, to a sequence in D°° ,c . We can now state Geronimus' theorem: 

Theorem 3.1 (Geronimus' Theorem). Let /i be a probability measure on &D, f its 
Schur function, and 7„(d/i) = 7n(/) the Schur parameters of f. Then 

Jn(d/j.) = a n (dfx) (3.3) 

This gives a continued fraction expansion of F whose coefficients are a n , and so 
is an analog of 1(1. 6|1 . This formula explains why we took a minus and conjugate in 
((2.17(1 . The procedure of dropping a Verbhmsky coefficient from the start can be 
understood by using the recursion relations and the relation of F to the OPUC (see 
Theorem 14.41 below^l. This approach to proving Theorem 13. II due to Peherstorfer 
[fi9"| . is discussed in Section 3.3 of [HHj- 

1)3. 1(1 / ((3.2(1 can be rewritten and then iterated following Schur [HI] : 

f(z) = 70 + (1 - %f)zfi (3.4) 

n— 1 r j— 1 -i 71 — 1 

= 7o + E Ui 1 -^) z j 1j+l[(l-%fk)z n fn (3.5) 
j=i L fc = o -I it=o 

which implies that if f(z) = J2n°=o a n(/) z ™, then 

n-l 

On(/) = In - |7j| 2 ) + polynomial in (70,70, ... ,7n-i,7«-i) (3.6) 

Plugging this into 1(2.4(1 and using 1(2.7(1 implies 

n-2 

c„(d/i) = 7„_i JJ(1 - | Tj | 2 ) + polynomial in (70,70, ■ ■ - ,7n-2,7n-2) (3-7) 
j=o 

(the polynomials are different but the leading terms are the same up to a shift of 
index) . 

1(3.6(1 also shows that if 7j(/) = 7j(<?) for J = 0, . . . , n— 1, then the Schur function 
\{f ~ 9) — 0(z n ) so, by the Schwarz lemma, 

7i(/)=7j(9). J=0,...,n-1=> |/(z)- 5 (z)| < 2|z|" (3.8) 
Lemma 3.2. 27ie map /rom Schur functions to D°°' c is one-one and onto. 

Proof. ((3.8(1 shows that if 7^ (/) = jj (g) for all j , then / = g on B. Given a sequence 
in B°°, define the Schur approximates, fi n \ by setting f^j_ 1 to in ((3.2|) and using 
{7j}"=o t0 define fn\fn-i, ■ ■ ■ J [n] - B y construction, 

l 3 (f ln] ) = h (3-9) 
[0 ]>n 

Since 7j (/'"■') = 7j(/' m ') for J < min(n, m), we have, by ((3.8(1 . that /M converge 
uniformly on compacts and the limit clearly has the prescribed set of 7's. Given a 
sequence in D 00 ^© 00 , suppose 7„+i = e lSo G <9D, set / n+1 = e l6 ° and use to 
define / with the prescribed 7's. □ 
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Proof of Theorems Wlh and UFH (z n - <&„) _L $„ =4> ||$n|| 2 = (z n , so applying 

*,n 

n-1 

($;,i) = ll$„f = n( 1 -K-l 2 ) ( 3 - 10 ) 

3=0 

Taking the inner product of Ij2.17|) with the function, 1, and using (<£>„ + i, 1) = 0, 
we see 

n-1 

(z$ n ,l)=a n Y[(l-\ctj\ 2 ) (3.11) 

j=o 

By (|2.17|l and induction, the coefficients of are polynomials 

a>o, ao, ■ . ■ , otj-t, (Xj-i and so, by induction, the moments c, J+ \ are polynomi- 
als in the same a's. Then H^.lljl becomes (a formula of Verblunsky) 

n-1 

Cn+1 ^n 

FT (1 - | ctj- 1 2 ) + polynomial in (a , a , a„_i, a„_i) (3-12) 

3=0 

We will now prove Theorem 13 . 1 1 bv induction and then Theorem 12.31 follows from 
Lemma l3~2l For n = 0, we have, by I3.12jl and (j3.7jl . that 

ci = a = 70 (3.13) 

Suppose we know aj — jj for j = 0, 1, . . . , n — 1. We fix those n values in B 
and ask what values of c n +i can occur. By <|3.7[1 . it is a solid disk in C of radius 
n?=o (1 ~ l a il 2 ) since 7„ can run through B. The center of the disk is some fixed 
point (given fixed {7j}"=o). 

By (|3.12() . it is also a subset of the disk of radius n^=o (1 — \ a j\ 2 ) with possibly 
another center. But since the sets are the same, the centers must be the same, and 
all ctj must occur. Once we know the centers and radii are the same, the equality 
of the formulae for c„+i implies a n = 7„. □ 

4. Zeros, the Bernstein-Szego Approximation, 
and Boundary Conditions 

Our first goal in this section is to prove that the zeros of OPUC lie in B. There 
are six proofs of this in [SSj. We pick the one that is shortest, using the same 
argument that led to (|2.19l) . 



Theorem 4.1. $ n has all its zeros in B and <£>* has all its zeros i 

Proof. (Landau [SS]) Let $„(zo) = and define P(z) — Q n (z)/(z — zq). Since 
degP = n - 1, P _L Thus 

||P|| 2 = ||zP|| 2 = \\z a P + $„|| 2 = |z | 2 ||P|| 2 + ||$„|| 2 (4.1) 

so ||$„|| 2 = (1 - |z | 2 )||P|| 2 , implying \z \ < 1. Since $* n (z ) = O $„(l/z) = 0, 
the result for implies the result for $*. □ 

Next, we will identify measures with a,-(d/i) = for j > n a . The key is a 
calculation that goes back to Erdelyi et al. [22] ■ 

Proposition 4.2. Let P n be a polynomial of degree n with all zeros in B. Let 

= 2APj^W (4 ' 2) 
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where c is picked to make d/j a probability measure. Then for all integral j < n 
(including j < 0), 

(z j ,P) L 2 ido , dfl) =0 (4.3) 

Proof. 

(zi,P) mdW = f e~^P(e ie ) ,W 



2iTZ- n P*(z)P(z)\. 



1 f z „-j-i_dz_ 



2ni J P*{z) 

is zero for n — j — 1 > since P*(z) is nonvanishing on D. □ 
Theorem 4.3. Let dfi be a nontrivial probability measure on <9B. Let 

d ^ n = 2K\<p n (e",dri\2 (4 ' 4) 
Then dfi n is a probability measure with 

, , . I aAda) j <n—l ,, 
aj(d»n)= < lK ^' . (4.5) 
[0 j > n 

Proof. Let dv = c d\i n where c is picked so that J dv — 1 (eventually, we will prove 
c = 1). By Proposition (* j dp)) L 2 {mAv) = for j = 0, 1, . . . , n - 1, 

so $ n (,2:;<if) = 4> n (z; <i/z). It follows from Theorem 12.31 that otj(dv) = aj(dfi) for 
j = 0, ...,n— 1 and (p n (z;d/i) = ip n (z;dv). Therefore, 1 = J\ip n \ 2 dv = c, so 

dv = djJL n . 

By Proposition 14. 31 for any k > 0, 

(z J ,z fc $„) L 2 (aDidMii) = j = 0,...,n + fc-l (4.6) 

It follows that Q n+ k(z;dn n ) = z k & n (z; d(j, n ) and thus, $„+fc(0) = for k > 1. 
Therefore, by l|2.18|l . aj(dfi n ) — for j > n. □ 

Even though Theorem 14.31 was proven by Verblunsky |l()3j and rediscovered 
by Geronimus [S] (to whom it is often credited), d\i n are called Bernstein-Szego 
approximations since Szego |HU first considered measures of this form (|3.2|) and 
Bernstein jllj their OPRL analog. Since, for each fixed j, aj(dpL n ) — > aj(dfx) 
(indeed, they are equal for n > j), d/i n — > d/i weakly since S is a homeomorphism. 

Some thought about the form of d/i n suggests its Caratheodory function should 
be a rational function whose denominator is ip* n . We will prove this by identifying 
the numerator. The second kind polynomials, ip n , are the OPUC for the measure 
djjL-i with otj(dfi-i) = —aj(dfi). Notice that in terms of the matrix A of 



(note a„(c?/i), not a„(c?/i_i)). Thus 

_ll l:)=A {z ,a n ^...A( Z ,a )(_{ j) (4.8) 

Taking determinants, using det(^4) = z, 

p>„ + <p n r„ = 2z n (4.9) 
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Theorem 4.4 (Verblunsky [EB1). Let dfi n be given by (|4.4(l . Then 

F(z,dfj, n ) = h—r-s ( 4 - 10 ) 



Proof. For z = e w , l|4.9|l can be rewritten as Re(ip n (e ie ) tp n (e ie )) = 1. Thus, if 
G(z) is the right side of (|4.10() . 

Re(G(e i9 )) = / (4.11) 
|<,5„( e )l 

Since G is analytic in a neighborhood of B, ReG > on D. Since G(0) = 1, the 
complex Poisson representation (see Rudin pg. 235]) and l|4.10[l imply that 
G(z) is the Caratheodory function of dfj, n . □ 

It is useful to think of dfi and g?/i_i as embedded in a family d/j>\ for A £ 9D. 
The Aleksandrov family associated to dfi is defined by 

aj(dfix) = Xaj(dn) (4-12) 

Given Geronimus' theorem (Theorem 13. 1|) . it is easy to see that 

f{z 1 dfi x ) = Xf{z,dfi) (4.13) 

(for 70 (A/) = A7o(/) and (A/)i = A(/i)). So, by (El and its inverse, zf(z) = 
(F(z)-l)/(F(z) + l), 

F(Z ' - (1 + A) + (1-A)F(^) (4 ' 14) 

which is the original definition of Aleksandrov it is Golinskii-Nevai [35] who 
realized its relevance to OPUC and boundary conditions. If tp$i\z) — (p n (z; dfj,\), 
then 

/ W \ / O) \ 

G&) -«*■*•>(&>) 

so and ip„ obey the same difference equation, but the n — boundary values 
change from (*) to Q). The Aleksandrov family is the analog of variation of 
boundary conditions in second-order ODE's. 

A direct calculation (via contour integrals) shows that if Re(a) > 0, then 

^ (l-e*°) + (l + e^a d9 

(1 + e ie ) + (1 - e ie )a 2?r 1 ' 

Since 1 is the Caratheodory function of d9/2ir, l|4.16|) and (|4.14|) imply 

Theorem 4.5 (Aleksandrov |2|, Golinskii |3H]). For the Aleksandrov family, we 
have 

WM^ = ^ (4-17) 

This is the OPUC analog of the Javrjan 02]-Wegner jl()4j averaging for 
Schrodinger operators, which is the basis of the localization proof of Simon- Wolff 
[§3"| . It can be used jEH] to prove localization for suitable random OPUC. 
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5. The CMV Matrix 

Perturbation theory involves looking at similarities of measures when their 
Vcrblunsky coefficients are close in some suitable sense. In the analogous OPRL 
situation, the Jacobi matrices, (|1.4|l . are an invaluable tool. If one defines the es- 
sential support of a measure to be the support with isolated points removed, and if 
p and 7 are measures on [c, d] C K with Jacobi parameters a n , b n and a n ,b n , then p 
and 7 have the same essential support if \a n — a n \ + \b n — b n \ — > 0. This can be seen 
by noting that the difference of the Jacobi matrices is compact and then appealing 
to Weyl's theorem on the invariance of essential spectrum. 

In this section, we discuss a suitable matrix representation for multiplication by 
z in L 2 (<9B, dp). There is an obvious choice, namely, zip m ), but this is not the 
"right" one. It has two problems. IfX)l a il 2 < oo, {(/? n }^L is not a basis, and so this 
matrix is not unitary. Even worse, this matrix has finite columns ((<p n , z(p rn ) = 
if n > m + 1) but, in general, it does not have finite rows. 

The right basis, as discovered by Cantero, Moral, and Velazquez |T2] is 
the one, xo, Xi, X2, ■ ■ ■ , obtained by orthonormalizing 1, z, z -1 , z 2 , z~ 2 , . . . . We 
will also want to consider the basis, xq, x\, x%, . . . obtained by orthonormalizing 
1, z~ , z, z~ 2 , .... Remarkably, the x' s can be expressed in terms of ip's and <p*'s, 
and the matrix elements in terms of a's and p's. 

Proposition 5.1. 

(a) X2n(z) = Z~ n ip2 n (z) X2n-l(z) = Z-'' l+1 ip 2 n~l(z) (5.1) 

(b) x 2n (z) = z~ n ip 2n (z) x 2n -i(z) = Z~ n (p^ n _ x {z) (5.2) 
Proof. In terms of the projections P\k,t] of (|2.22|l . we have 

(l-P^,!])^ . (l-ifr, m] )l 
Pm. = jj jj <P m = jj jj (5.3) 

where || . . . | is the norm of the numerator. Since multiplication by z l is unitary, 

Z (f2n = jj jj = X 2n 

proving the first half of l|5.2|l . The others are similar. □ 
We define four matrices (C = CMV matrix) by: 

Cm = (xk, zxi) Cke = (xk,zxe) Ckt = {xk,zxt) Mm = (xk,xe) (5.4) 
Clearly, 

C = CM C = MC C = C* (5.5) 

where the last comes from the fact that the explicit formulae below show C and M. 
are (complex) symmetric. Define, for a € B, the 2x2 symmetric matrix: 

e(a)=(^ J a ) (5.6) 

Theorem 5.2. Let 1 be the lxl unit matrix. Then 



M = 1 ffi 6(ai) 8 9(a 3 ) © 



C = 9(a ) © B(a 2 ) © 9(a 4 ) © 



(5.7) 
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Proof. This is an expression of the Szego recursion formula. For example, the 2n 
row (labelling rows 0, 1, 2, ... ) of £ says that zx 2n — ct2nX2n + P2nX2n+i which, by 
Proposition 15. II is equivalent to zif2 n = ^nf^n + P2n l P2n+i, which is the top row 
of fTM . 



While C and M. have direct sum structures, in general (i.e., if all |cty| 
does not. Indeed, by (|5.5jl and (|5.7(l . 

/ 



C 





aipo 


PiPo 








Po 


— tiiQo 


-piao 











a 2 pi 


—a 2 ai 


UzP2 


PZP2 





P2P\ 


— P20t\ 


—a 3 a 2 


~p 3 a 2 











OiAPz 


-014013 



\ 



\ 



□ 

< i), c 



(5.8) 



/ 



Thus C has a 4 x 2 block structure and is generally five-diagonal. It is the simplest 
unitary matrix with a cyclic vector; for example 13] . any tridiagonal semi- infinite 
unitary is a direct sum of 1 x 1 and 2x2 matrices. 

Theorem 5.3. If C w is the top left N x N block ofC, then 

det(z l-CW) = $ N (z ) (5.9) 

Sketch. If £ is the operator of multiplication by z in L 2 (dB,dp), then = 
P[-i,N-e-i](P[-i,N-e-i] restricted to raaR_.£ #-4-1] where Pu t k] is given by (|2.22(l 
and I is either (N — l)/2 or N/2. Since multiplication by z £ is unitary, 

C (N) 

is 

unitarily equivalent to P[o,n~i]CP[o.n-i] on ran -P[o.Ar-i]- 

zq is an eigenvalue of P[o } n-i](P[o,n-i] if and only if there is Q of degree N — 1 
so (z — zq)Q = $at(z), that is, if and only if &n(z ) — 0. This proves (|5.9(l if <E> n 
has distinct zeros. By a limiting argument (see Theorem 1.7.18 of |HH])i fi-ty holds 
in general. □ 

Remark. This theorem sheds light on a result of Fejer [531 that for OP's of 
general measures on C, their zeros lie in the convex hull of supp(d/i). For (|5.9() 
implies the zeros are in the numerical range of C^ N \ so the numerical range of C, 
which is the convex hull of supp(dp) by the spectral theorem. In particular, Fejer's 
theorem implies in the OPUC case that if £ € 9D with d = dist(£, supp(d^)) > 
and $„(j»o) = 0, then \z - (\ > \d 2 . 



Notice that if la,- 



1, 6(a i 



Sj 
-a, 



is a direct sum, and so C = CM has 



a {j + 1) x (J + 1) unitary block in the upper corner. This implies that if (3 £ 



then $ 



W 

N 



— f3&* N1 has all its zeros on cHD) since they are eigenvalues of 
a unitary matrix. The are called paraorthogonal polynomials and studied in 

E2IBIE3 

Dombrowski |2(Jj proved that a Jacobi matrix with liminfa n = has no a.c. 
spectrum by picking a subsequence with YlfLo a n(i) < 00 an< i trace class perturbing 
to a decoupled direct sum of finite rank matrices. Unaware of this work, Simon- 
Spencer |92j proved a similar result if limsup|6„| =00. As noted by Golinskii- 
Simon ^Hj) the same idea and CMV matrices prove the following, originally proven 
by other means |81j . 

Theorem 5.4 (Rakhmanov's Lemma |81|L If p, is a probability measure on <9B so 
Umsup|a n (d/i)| = 1, then p is singular with respect to d6/2i:. 
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Golinskii-Simon also use perturbations of CMV matrices to prove: 

Theorem 5.5 f|40|h If \x,v are two probability measures on <9ED so \a n (d/j,) — 
a n {dv)\ — ► 0, then esssup(d/i) = esssup(<i^). If ^2 n \a n (dp) — a n (dv)\ < oo, then 
the absolutely continuous parts of fj, and v are mutually absolutely continuous. 

Aleksandrov families fit into CMV matrices with a twist. C({Xa n }) and C({a n }) 
do not differ by a rank one perturbation — rather they do up to a unitary equiva- 
lence. Specifically: 

Theorem 5.6. Let A £ dD and {a n } £ D°° . Let D be the diagonal matrix with 
elements 1 , A -1 , 1, A -1 , . . . . Then DC({Aa„})L> _1 = £{{a n })M\({a n }) where Mx 
differs from M. by having A in the (1,1) position instead of 1. 

This is a restatement of Theorem 4.2.9 of [SS]. A generalization to rank one 
perturbation in the n-th diagonal can be found in Simon |90| . 

CMV matrices have been generalized in two directions. First, OPUC can be 
thought of as an analog of half-line ODE. The whole-line analog is an extended 
CMV matrix, £, defined on £ 2 (— oo,oo) by a two-sided sequence {a n }^-oo as a 
product of • • • © 6(a_ 2 ) © 6(«o) © 0(a 2 ) © • • • and ' • ■ © ©("-l) © ©("l) © • • • 
where ©(ay) acts on the span of Sj and dj+\. This is discussed in Sections 4.5, 10.5, 
and 10.16 of ISHUHHI- It is useful in the study of ergodic (SectionHJ) and periodic 
f Section Hn|l OPUC. Gesztesy-Zinchenko have further results on £. 

Second, if U is an n x n unitary matrix and <p is cyclic in that {C J V}?=o ^ s a basis, 
then the spectral measure for (p has n points, defines polynomials $0, ■ ■ ■ , an d 
Verblunsky coefficients ao, . . . , a,i_2 € U a nd a n _i € 9D. £/ is unitarily equivalent 
to a finite CMV matrix, the upper block of an infinite matrix where a n -\ is taken 
in dD. 

Just as the theory of sclfadjoint matrices with cyclic vector is identical to the 
theory of Jacobi matrices, the theory of unitary matrices with cyclic vector (i.e., 
{IJi Lp}JL_ OQ spanning) is identical to the theory of CMV matrices. The Verblunsky 
coefficients are a complete set of unitary invariants. 

In this regard, there is a natural question answered by Killip-Nenciu pU]. Let 
U(n) be the group of nx n unitary matrices and consider Haar measure on U(n). For 
a.e. U, (10... 0)* is cyclic, so there is induced a measure on Verblunsky coefficients 
ao, . . . , a n -2 £ ID and a„_i £ 83. The measure is the same if (1 . . . 0)* is replaced 
by any other vector or by a random choice (say, uniform distribution on the unit 
sphere in C"). 

Theorem 5.7 ('|50|'). Under the measure induced by Haar measure on U(n), the ctj 
are independent (i.e., the induced measure is a product measure), a n -\ is uniformly 
distributed on dD, and ctj, j = 1, . . . , n — 2, is distributed via 



6. Transfer Matrices, Weyl Solutions, and Lyapunov Exponents 

In this section, we present a potpourri of results connected with solutions of 
Szego recursion l|2.28|l where the two components are freed of ui, = u\ — indeed, 
we look at solutions for a fixed z. Thus, solutions have the form 



(n-j- 1) 



(l-\a\ 2 ) (n -i- 2 U 2 



(5.10) 



a 



u(z;n) = T n (z)u(z; 0) 



T n (z) = A(z,a n -i 



)...A(z,a Q ) 



(6.1) 
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with A given by (|2.29|) . T n is called the transfer matrix. By |4.8|l . we have 

rp i \_x( <Pn(z) + lpn(z) (fin(z) ~ 1p n {z) 

11/^ \-zA n ^{z) B n ^{z) ) 
j=o ' K ' 

where A n _i and i?„_i are degree n — 1 polynomials and the * term is *> n ~ . 
The degree count uses ip n {0) = — ^> n (0), <^(0) = V'nC 2 )- an d B« are the 
WaZZ polynomials which are related to the Schur approximants, p n \ of l|3.9[) by 
fW(z)=A n (z)/B n (z). 

For 2 e SO, T n lies in the group U(l, 1) of matrices obeying M* (J ° x ) M = 
( q ) . Features of this group play a role in advanced aspects of the theory; see 
|55|. especially Section 10.4. 

The solutions u v = (ip n , ) and u$ = (ip n , — tp^) of l|6.1|l can be combined into 
an £2 solution for \z\ < 1: 

Theorem 6.1 (Geronimo [27]; Golinskii-Nevai [3^1). Fix z e KD. TAen u^(z;n) + 
ru<p(z;n) -> os n -> 00 /or /tied r £ C if and only if r — F(z). Moreover, 
+ F(z)u v is in I 2 . 

Remark. In analogy to ODE theory, + F{z)u tp is called the Weyl solution. 

Sketch ^|39| ) . Looking at the second component, we see that if u^p + ru v —> 0, then 
-1/3* + ry* n -> 0. By J233I), r - tp*/(p* -> 0, so by (SHE) , r = That the first 

components go to zero for r = F(z) will follow from the I 2 proof. 

By using the CD formula l|2.32|l for ip and tjj plus a mixed CD formula obtained 
from (|2.31(l by using (,) and (_■,), one finds that 

n-l 

(l-|z| 2 )^|^(z)+r^-(z)| 2 -4Re(r) + |CW-^;| 2 -|^W + ^„(z)| 2 (6.3) 
3=0 



Taking r = ip^(z)/ip* n (z), one finds 



k < n - 1 V ^ + < 4Ref ^ 



3=0 

Taking n — > 00 and then k — > 00 shows 



(6.4) 



^|V J+ F^| 2 <4Re(F) (6.5) 
3=0 

The inequality in l|6.5|l plus equality in i|6.3[l imply that \ip* — Fip*\ < + Ftpj\, 

so (|6.5f) implies + Fu v € I 2 . □ 

Another way of proving the i 2 result, from |27J, is illuminating. It starts from a 
formula which was Geronimus' original definition of the second kind polynomials, 
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This and its image under the map * imply 



e <e + z ■ 



(6.7) 



Using f|y?„|d/i < 1 and (e if? + z)(e ie - z)" 1 = 1 + £^=i 2(e~ l9 z)™, we see 
the Taylor coefficients of each expression in (16.711 are bounded by 2. Since 
/ e- ike cp n (e ie )dfi(6) = for fc = 0, . . . ,n- 1, we see |F<^ n + ^„| < 2|z|"(l - ^l)" 1 , 
while |i*V* — ^*| < 2|z| n+1 (l — |z|) _1 . This proves not only an £ 2 property but 
exponential decay. 

The next issue we want to discuss is Lyapunov exponents. To understand them, 
it pays to also discuss the density of zeros, an object of independent interest. Given 
dfi, a nontrivial probability measure in dO, define the measure dv n on D to be the 
point measure which gives weight k/n to a zero of $ n of multiplicity k. On account 
of |EU for £ = 0,1,2,..., 

z i du n (z) = -Tr([C^] e ) (6.8) 
n 

which can help show that dv n sometimes has a weak limit; if it does, we say the 
limit is the density of zeros. The limit may not exist; there even exist examples 
(see Example 1.1.17 of [SS]) where the set of limit points of dv n is all measures on 
D! Here is how (|6.8|) can be used: 

Theorem 6.2 (Mhaskar-Saff [55]). If 



n-l 



lim|a„| 1 /" = r and -^1*1^0 (6.9) 

3=0 

(automatic if r < 1), then dv n converges to the uniform measure on the circle of 
radius r. 

Sketch. (See Sections 8.1 and 8.2 of for details.) An argument that exploits 
Theorem l9.1l below and the fact that |a„_i| = |$ n (0)| is the product of zeros shows 
that when the first equation in (|6.9|l holds, then all limits of dv n are concentrated 
on the circle of radius r. The second equation and (|6.8|) show that any such limit, 
dv, has J z dv = S?o for £ > 0. □ 

The other case where we know v n has a limit is ergodic families of Verblunsky 
coefficients. Let (Q, df3) be a probability measure space, T : Q — > f2, an invertible 
ergodic transformation, and V : £1 — > D. For each wgO, define a measure dfx u by 

a j (dit a ) = V(T i u) (6.10) 

An argument using the ergodic theorem, l|6.8[) . and control of \\m\a n (d^L^)\ 1 ^ n show 
that so long as J[— logF(o;)] d/3(ui) < oo, then d/i^ has for a.e. ui a limit supported 
on cW and cj-independent. The most important examples of ergodic families are 
random, periodic, almost periodic, and subshifts (see Chapters 10-12 of [EH])- 
Before leaving the subject of zeros, we note: 

Theorem 6.3 (Widom jl()6| ). If supp(d/x) is not all of dU, then for any r < 1, 
sup„(# of zeros of $ n in \z\ < r) < oo. In particular, any limit of dv n is supported 
on 0B. 
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Theorem 6.4 (see Theorem 8.1.11 of [BED- If z o in <9B is an isolated point of 
supp(d/i), there is precisely a single zero o/$„ near zq forn large and it approaches 
Zq exponentially fast. 

Finally, we discuss the Lyapunov exponent and Thouless formula: 

Theorem 6.5 (see Theorem 10.5.8 of [EH]). If the density of zeros measure, dv, 
exists and is supported on 9D, and if 

(n—X \ l/n 
II Pi ) ( 6 - n ) 
3=0 ' 

exists, then for z £ dU>, the following limit exists and is given by 

j(z) = lim - log||T n (ar)|| =-log Poo - f log\e i9 - z]- 1 dv{0) (6.12) 

n^oo n J 

7 is called the Lyapunov exponent. (|6.12(l is called the Thouless formula. For 
\z\ > 1, \<p n \ > |<p*| and \tp n \ > iV'nl) we nee d only control the growth of \ip n \ 
and \ipn\- By (|6.8[) . <p n and ijj n have the same density of zeros. Writing ip n = 

n;=o pj 1 n Z ero S (^ - ra- 

See [SSI for discussion of when (|6.12|1 holds on 9D and for further study of ergodic 
OPUC. 



7. Khrushchev's Formula, CMV Resolvents, 

AND PlAKHMANOV'S THEOREM 

In two remarkable papers [481 149| . Khrushchev found deep connections between 
Schur iterates and the structure of OPUC. A key input for the theory is: 

Theorem 7.1 (Khrushchev's Formula). The Schur function for the measure 
\(p n (e , dii)\ 2 dfi(8) is given by b n (z)f n (z), where f n is the n-th Schur iterate [by 
Geronimus ' theorem, this is the Schur function of the measure with Verblunsky 
coefficients {a n +j}f%o) an( I bn is the Blaschke product, 

b n (z;dfi) = (7.1) 
ip*(z;d(j.) 

Remark. Khrushchev's formula illuminates (|2.9[) . In this trivial measure case, 
{zj}™~l are the zeros of $n_i and e l6 ° is the Schur parameter, 7„_i. 

In terms of the CMV matrix, this gives a formula for (S n , (C + z)(C — z)~ 1 5 n ), 
and so when n = m for 

G nm {z) = (5 n , (C - z)- l 5 m ) (7.2) 

the analog of the Green's function in ODE's. Half-line Green's functions for ODE's 
have the form /_(min(cc, ?/))/+ (max(a:, y)) where /_ (resp. /+) obeys boundary 
conditions at x = (resp. x = oo). There is an analogous formula, due to Simon 
(even if n ^ m), for G nm in terms of the OPUC and Weyl solutions. It can be found 
in Section 4.4 of [HE] and generalizes Theorem 17. II Other proofs of Theorem 17.11 
appear in Theorem 4.5.10 of [88] and Theorem 9.2.4 of [HHl- The most important 
consequence of Khrushchev's formula is: 
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Theorem 7.2 (Khrushchev 0H])- The essential support of the a.c. part of d[i is 
all of <9ED if and only if 

lim /*|/»(e <9 ,d/*)| a ^=0 (7.3) 



Since 



an immediate corollary is 



2tt jfl 

/n(e ie ,^) — =/ n (0)=a„ (7.4) 

Z7T 



Theorem 7.3 (Rakhmanov's Theorem). If the essential support of the a.c. part of 
d\i is all of dO, then 

lim a n = (7-5) 



71 — >OG 



This result is originally due to Rakhmanov [HOI EH E2| with important further 
developments by Mate-Nevai-Totik [HHI EDI ESI ESI- Bello-Lopez [TOJ extended this 
result to arcs, and Denisov ^Uj to OPRL. Here are some other important results of 
Khrushchev's theory: 



Theorem 7.4. 



d6 

w-\im.\ip n (e ie )\ 2 dfi = — (\fj) lim a n+j a n = 

Z7T n—>oo 



Remark. (|6.8[1 can be reinterpreted as saying weak Cesaro limits of \(p n \ 2 d/i are 
the density of zeros when the latter is supported on SO; see Section 8.2 of |88| . 

Theorem 7.5. Let f^ be the Schur approximates [given by <|3.9|) L Then 

l/ W (e ie )-/(e ie )| 2 f -0 (7.6) 

if and only if either 

(i) dfi& c = 0, that is, /i is purely singular, or 

(ii) a n {dfj,) — > 0. 

Moreover, if w-\im\ip n (e ie )\ 2 dfi = d6/2iT, then (j7.6j) holds. 

As a consequence of these theorems, we get a result for sparse a's: 

Corollary 7.6. J/ lim moo a n+jfln — /or all j , but limsup n |a„| 7^ 0, f/ieri /i is 
purely singular continuous. 

Theorem 7.7. Suppose that uniformly on compacts of dH>, 

cf>* ( 7 \ 

lim "+;y =G(z) (7.7) 

rwoo $*( Z ) W V ; 

i/ien either G(z) = 1 or e/se /or some a e (0, 1] and A € <9B, 



G(z) = 5 [(1 + Az) + v/(l - \zf + 4a 2 Az] (7.8) 

Note we have that G = 1 if and only if linin^oo a n+ ja n = for all j and that 
Barrios-Lopez have proven that (|7.8fl holds if and only if linin^oolanl = a and 
lim n ^co otn+ia.- 1 = A. 

Khrushchev has also described all possible dv's that can occur as w-lim|</3„| 2 d/i 
(i.e., for which the limit exists) and when they can occur (essentially, asymptotically 
period 1 or 2). The analogs of these w- limit and ratio asymptotic results for OPRL 
were found by Simon |%7] . 
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8. Szego's and Baxter's Theorems 

Szego's theorems may well be the most celebrated in OPUC. While they have 
expressions purely in terms of OPUC objects, for historical reasons, one should 
state them in terms of Toeplitz determinants, D n (dfj,). This is defined as the 
determinant of the (n + 1) x (n + 1) matrix {ck-i}o<k,i<n with c given by 1(2. 8|1 . 
D n is the Gram determinant of {z }^_ since (z , z e )^2^ d ^ = Ck-i- The invariance 
of such determinants under triangular change of basis implies (using also 112.201) 1 

n n — 1 

Dn(d^) = n n%ii 2 = ri( i - \ a ^ n ~ j 

3=0 j=0 

which immediately implies 

oo 

F(d/i)= lim D n (d^ n = T[(l-\a ] \ 2 )= lim ||$„|| 2 (8.2) 

3=0 

G ^^^^^-n^-\^r^ (8.3) 

F is always defined, although it may be 0. G is defined so long as F > 0, that 
is, so long as X)j!Lol a il 2 00 ■ ^ ma y be infinite and is finite if and only if 

Szego's theorems express F and G in terms of the a.c. weight, w, of dfi: 

d9 

dfx = w{6) — + dfj, B (8.4) 

where w £ L 1 (9D, ^) and d/j, s is singular with respect to d9/2jT. 
Theorem 8.1 (Szego's Theorem). 

F(dfi) = f[(l- |a,-| 2 ) = exp( / \og(w(9)) (8.5) 

3=0 ^ J 77 ' 

Remark. Szego proved this when dfi s = in 1915; the proof below is basically 
his proof in |95|. The result does not depend on d/x s — this was shown first by 
Verblunsky QH3]. [HE1 IHHI have five proofs of Theorem IO 

Sketch when d/j, s = 0. Since <&* is nonvanishing on D and $^(0) = 1, 
Jlog|$*(e i9 )||^ = 1. Thus, by Jensen's inequality, 

||<M 2 ee J \K(e te )\M°) ^ > exp(| log(w(9)) 

so F(d/j.) > RHS of 1)8.5(1 . On the other hand, since <&* is the projection of 1 to the 
complement of [z, . . . , z n ] in [1, . . . , z n ], we have 

||$;j| 2 = min{||P|| 2 2(dM) | degP < n, P(0) = 1} (8.6) 

Using (|8.2|l and a limit argument, 

F(dfi) = mm{||/||f 2(d/j) | / € H°°, /(0) - 1} (8.7) 

Pick the trial functions f £ (z) = g £ (z) / g e (0) where 

g £ {z) = exp^-y ^— - log(u;(0) + £ )— j 
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and take e | to get F(dfi) < RHS of jHS- □ 

Because their singular continuous part is arbitrary, once an i 2 condition is 
dropped, d\i can be arbitrarily "bad" : 

Theorem 8.2. Let dp be a measure on <9B with support all of dU>. Then there exist 
dfi, a probability measure on <9B mutually equivalent to dp, so that for all p > 2, 

oo 

\a n (dp)\ p < oo (8.8) 
This is Theorem 2.10.1 of proven using ideas of Totik |1UU| and the bounds 

in 

By (I8.5|l . we get one of the gems of spectral theory, equivalences between some 
recursion coefficient property and some spectral measure property: 

Corollary 8.3. 

°° f df) 

Kf < oo o / log(w(9)) — > -oo (8.9) 

j=o J 

The equivalent conditions l|8.9|l are called the Szego condition. When they hold, 
Szego defined the Szego function by 

D{z) = exp(7 \og{w{e))^ (8.10) 

Standard boundary value theory for the Poisson kernel implies D(e l8 ) = 
lim r fi D{re l9 ) exists for d9/2ir-&.e. 9 e [0, 2tt) and 

\D(e w )\ 2 = w(9) (8.11) 

Theorem 8.4 (Szego IS]). Suppose (ESJ holds. Let D &c (e ld ) = D(e w ) for a.e. 9 
and = on a supporting set for dp, s . Then 

(i) Jw^-D^r^dp^Q (8.12) 

(ii) f\^ n (e i9 )\ 2 dp s ^0 (8.13) 



(iii) V*X Z ) ~* D(z) 1 uniform on compacts inV> (8-14) 

Sketch. A short preliminary argument proves that D £ H 2 (D). Thus the Cauchy 
formula holds for <^*-D, so 

d8 

(<p* n D)(e ie )-=<p* n (0)D(0)^l (8.15) 



since, by & <P* n (0)D(0) = n"=o (1 " l«i| 2 )" 1/2 II£o(l " Kl 2 ) 1/2 - By ED, 
1 1< - DCe^)- 1 ! 2 ^^) g + J K(e* e )| 2 dp s 

= \WV\\l* W + 1 - 2 Re (LHS of JHHSJl) - 

by (JHHSl). This implies (i) and (ii). This then implies that Dip* n -» 1 in L 2 (9D, ff ), 
so by H 2 theory, (iii) holds. □ 
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Ij8.14jl and the related 



z~ n ip n (z) -> D(l/z) on C\B (8.16) 
are called Szego asymptotics. 

Theorem 8.5 (Sharp Form of the Szego Strong Theorem HJEHHJEEl). If d(X s = 0, 
the Szego condition holds, and are the Fourier coefficients oflogw, then 



OO y CO x 

GW) = Y[(l - Kfr'' 1 = exp ]Tn|L„| 2 ) 



. „ , ( 8 - 17 ) 

Remark. Szego |98| proved this when d\i has certain regularity properties. The 
general result is due to Ibragimov see also 
Seeing when G(dfx) < oo leads to a second gem: 

Corollary 8.6. 

oo oo 

y~] j\otj\ 2 < oo <^ d(i s = and ^n|L„| 2 <oo (8.18) 

j=0 n=0 

This corollary relies also on a theorem of Golinskii- Ibragimov that the LHS 
of H8.18fl =>• dfi s — 0. This result plus five distinct proofs of Theorem 18. 51 are found 
in Chapter 6 of A sixth proof is in Section 9.10 of 89 . 

A final gem we want to mention is: 

Theorem 8.7 (Baxter's Theorem jS][!5]). Fix £ > 0. The following are equivalent: 

oo 

(a) ^2n e \a n \ < oo 

n=Q 

oo 

(b) d/i s = 0, mfw(0) > 0, and ^""^ n e \c n \ < oo 

71=0 

In particular ifd/j, s = and infg w(9) > 0, then w is C°° if and only i/sup„ n \ot n \ < 
oo for all I > 0. 

This is proven in Chapter 5 of |88| . 

9. Exponential Decay 
Suppose for some R > 1, we have 

\a n \<CR- n (9.1) 

By <|2.17[1 and induction, 

oo 

sup \$ n (z)\ < JJ(l + |a J |)<oo (9.2) 
n, |z|=l J=0 

so, by the maximum principle and 12.15fl . 

sup \z\- n \<$> n (z)\ = sup \$*(z)\ < oo 
M>1 kl<i 

Thus, by f2~27jl . if |z| < i?, 

OO OO 

n=0 n=0 
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It follows that $n(z) and so ^>n(z) converges uniformly on compacts of {z \ \z\ < R} 
and so, by l|8.14|l . D(z)^ 1 has a continuation to this disk. We thus have one-half 
of: 

Theorem 9.1 (Nevai-Totik 67]). Fix R > 1. The following are equivalent: 

(a) dfi s = 0, the Szego condition holds, and D(z)" 1 has an analytic continuation 
to {z | \z\ < R}. 

(b) limsup n ^ 0o |a n | 1 /™= J R-i 

The other direction uses the useful formula, 

din = a n = -DiO)- 1 J * n+ i(e i9 ) D(e l9 )~ l dfi(9) (9.4) 

Section 7.1 of 88 has a complete proof. One can say more ( 88, Section 7.2]) 
when this holds: a n + Taylor coefficients of D(z)~ 1 D(l/z) decays as 0(R~ 2n+£ ). 
There is also a lot known about asymptotics of the zeros when there is exponential 
decay (see jSSl Sections 8.1 and 8.2], [iniEHI and references therein). 

10. Periodic OPUC 

The theory of one-dimensional periodic Schrodinger operators (a.k.a. Hill's equa- 
tion) and of periodic Jacobi matrices has been extensively developed [21 [221 H2I 
El ED Enj. In the 1940's, Geronimus (301 found the earliest results on OPUC with 
periodic Verblunsky coefficients, that is, for some p > 1 and j = 0,1,2,..., 

a j+p = ctj (10.1) 

In particular, the case a 3 • = a G D\{0} yields OPUC called Geronimus polynomials 
(see Example 1.6.12 of |88|1. Many of the general features for OPUC obeying 
l|Tin|l were found by Peherstorfer and collaborators |SHl EHI EDI EU C21 E31 C31 ESI 
I7fil 1771 IT8"| . Geronimo- Johnson |28l l2"9"| have studied almost periodic Verblunsky 
coefficients. A reworking with some new results is Chapter 11 of jHHl) which uses 
methods mimicking the periodic Hill-Jacobi theory. 

We suppose henceforth that p is even. A basic object is the discriminant, 

A(z) = Tt(z-p/ 2 T p (z)) (10.2) 

where T p (z) is the transfer matrix given by l|6"2|) . The z'P/ 2 is included 
since, by det(^4) = z, det(z~ p / 2 T p (,z)) = 1, and so z~ p / 2 T p {z) has eigenvalues 

Y ± i*J 1 — ( y) 2 - I n particular, these eigenvalues have magnitude 1, that is, 
sup m ||T mp (z)|| < oo exactly when A(z) e [—2,2]. This is part of: 

Theorem 10.1. There exists {xj} 2 ^ 1 , {yj}^i with 

x% < ?/i < x 2 < y 2 < ■ ■ ■ < x P < V P < x\ + 2ir = x p+1 

so that the solutions of A(z) = 2 (resp. —2) are exactly e lXl , e ly2 , e™ 3 , . . . , e lXp (resp. 
e iy \e lX2 ,e ly3 ,...,e ly v) and A(z) e [-2,2] exactly on 

B=\J{e ie \x j <9 j <y j } (10.3) 

3=1 

the bands. B is the essential support of d^ ac and the only possible singular spectrum 
are mass points which can occur in open gaps (i.e., nonempty sets of the form 
{e ie | yj < 9 < Xj + i}) with one (or zero) mass point in each gap. 
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Theorem 10.2. Let dp be the equilibrium measure for B (i.e., the minimizer for 
E(p) = J \og\z — u>| -1 dp(z) dp(w) with supp(dp) C B and p(B) = 1). Let Cb be 
the logarithmic capacity of B (i.e., exp(— minimizing value of£(p))) and Q(z) the 
logarithmic potential for B (i.e., Q(z) — J log|z — uj\~ 1 dp(oj)). Then 

(i) dp is the density of zeros for dp, — [Q(z] + logCs] is the Lyapunov exponent, 

a^C B = n^o(lH«,l 2 ) 1/2p - 

(ii) dp can be written in terms of the discriminant as 

, , A . 1 \dA(e w )/de\ d9 . . 

d P ( 9 ) = ~ 77 hr, vmi/9 77- 10 " 4 

w p (4- A 2 (e l9 )) 1 / 2 2vr v ; 

(iii) For each j = 1, 2, . . . ,p, 

p({e i6 \xj<6< y 3 }) = ~ (10.5) 

The proof of this result (see Section 11.1 of |SS]) depends on noting that, by the 
Thouless formula, -f(z) is harmonic on C\B and 7(2) = on B. (iii) is related 
to half of the following result of Peherstorfer motivated by an OPRL result of 
Aptekarev 

Theorem 10.3 (Peherstorfer |HE1)- Let B be a union of t disjoint, closed intervals, 
Si, ... , Bi, in 83. Then B is the set of bands of a period p set of a 's if and only if 

(1) If dp is the equilibrium measure of B, then pp(Bj) G Z for j = 1, . . . ,£. 

(2) Let Zi, Z2, ■ ■ ■ , zi v be defined clockwise around the circle so that z\ is the lower 
edge of B\ and the 2p points are the 2£ band edges and those interior points in 
a band with pp(Bj) > 2, that divide Bj into pp(Bj) sets with p-measure 1/p, 
each counted twice. Then ziz^z^z^Zg . . . Z2 P = 1. 

If some p(Bj) is irrational, then there is no periodic family of a's with those 
bands, but there is an almost periodic set, as proven by Geronimo- Johnson |29| 
(see Section 11.8 of [55] 1. 

Given a measure dp on 9D so that (|10.1|) holds, the Dirichlet data is defined 
partly as the p points where (.J is an eigenvector for T p (z), that is, zeros of <fip(z) — 
(Pp(z). There is one such point in each gap, including closed gaps (i.e., e lyj when 
jjj = Xj + i). If the value is at a gap edge, the eigenvalue, A, of z~ p / 2 T p (z) for 
(J) is ±1. Otherwise, it is in K\{0, — 1, 1}. In that case, we add Oj — ±1 to the 
j-th Dirichlet point with Oj = +1 (rcsp. —1) of the eigenvalue, |Aj| < 1 (resp. 

I A j j > 1). The point masses of dp are precisely those Dirichlet points inside gaps 

with o~j = +1. The set of allowed Dirichlet data are single points for closed gaps 
and a circle ([yj, Zj+i] X { — 1, 1} glued at the ends) for open gaps. Thus, the totality 
is a torus of dimension t = # of open gaps. 

Theorem 10.4. If A has i open gaps, then the subset of {aj}^Zo <= IS P which, 
when periodized, have discriminant A is a torus of dimension i. The map from 
these a 's to the possible Dirichlet date is a bijection. 

Critical to at least one understanding of this result is that the Caratheodory 
function, F, has a minimal degree meromorphic continuation to the genus £ — 1 
hyperelliptic Riemann surface associated to %/A 2 — 4. 

There is a natural symplectic form on ID P so that the real and imaginary parts 
of the coefficients of A are the set of integrals of a completely integrable system; 
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this is described in Section 11.11 of J>H! and in [HI]. The associated flows include 
the defocusing AKNS flow. 

11. The Szego Mapping and the Geronimus Relations 

Finally, we discuss a deep connection between OPRL and OPUC found by Szego 
|96j . The map z i— > z + z^ 1 maps D biholomorphically to C U {oo} with a cut 
[—2,2] removed. The map on the boundary, e lS — > 2 cos 9, is a two-to-one map 
of dD to [—2,2] that induces a map from M.+,i([— 2, 2]) to those measures on SO 
which are invariant under complex conjugation. It is easy to see p G A^+,i(9B) 
has such invariance if and only if its Verblunsky coefficients are real. Explicitly, p, 
a probability measure on [—2,2], is associated to p = Sz(p), an even probability 
measure on dH, via 



f(x)dp(x) = J f (2 cos 6) dfi(9) (11.1) 

Szego found the OPRL, P n , for p in terms of the OPUC, $„, for p: 

= ll-a 2 n-i(dp)}- 1 z- n [$2n{z) + $* 2n {z)} (11.2) 

and used this to convert Szego asymptotics for OPUC (see HS.lfijl ) to asymptotics 
for suitable OPRL. This asymptotics is often called Jost asymptotics in the discrete 
Schrodinger literature. 

Geronimus found the relation between the Jacobi parameters {<2n>&n}5£Li 
for p and the Verblunsky coefficients {a„}^L for p (with a_i = 1): 

a n+l = (1 - a 2n-l)(l - Otl n )(l + «2n+l) (H- 3 ) 
bn+1 = (1 - a2n-l)0!2ri — (1 + OL2n-\)C±2n-2 (H- 4 ) 

The map from a to (a, 6) is local, that is, changing a single a only changes a 
finite number of a's and 6's. That is not true for the inverse. Scaled Chebyshev 
polynomials of the first kind has a\ = a n = 1 (n > 2), b„ — 0, and the 
corresponding a n = 0. Scaled Chebyshev polynomials of the second kind have 
a n = 1, b n = (i.e., they differ at a single a n ), but have ct2n = and ot2 n -l = 
-l/(n+l). 

Still the inverse can be computed ([2]). Given {a n , b n }^ =1 , define <p^ by ipo = 0, 
ipi = 1, and for n > 1, 

+ «n-l^-l + &n<Pn = ±2<^t (11.5) 

By a Sturm oscillation theorem, {a n , b n }^ = i are the Jacobi parameters of a measure 
supported on [—2,2] if and only if cp+ > and (— l) n <p^ > 0. The Verblunsky 
coefficients are given by 



Vn+2 fn+2 
V n = = — 

VZ+1 <Pn+l 



(11.6) 



Ci2n 



; — - a 2n -i = I ~ h (u n + v n ) (H-7) 

Recently, these mappings have been used by Denisov ^5] and Damanik-Killip 
[171 |HE] as a powerful tool in the study of discrete Schrodinger operators and of 
OPRL. For proofs and references, see Chapter 13 of |89j . 
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